Learning TRECVID'08 High-Level Features from YouTube

نویسندگان

  • Adrian Ulges
  • Christian Schulze
  • Markus Koch
  • Thomas M. Breuel
چکیده

Run No. Run ID Run Description infMAP (%) training on TV08 data 1 IUPR-TV-M SIFT visual words with maximum entropy 6.1 2 IUPR-TV-MF SIFT with maximum entropy, fused with color+texture and motion (NN matching) 5.9 3 IUPR-TV-S SIFT visual words with SVMs 5.3 4 IUPR-TV-SF SIFT with SVMs, fused with color+texture and motion (NN matching) 6.3 training on YouTube data (no use of standard training sets) 5 IUPR-YOUTUBE-S SIFT visual words with SVMs 2.2 6 IUPR-YOUTUBE-M SIFT visual words with maximum entropy 2.1 We participated in TRECVID’s High-level Features task [17] to investigate online video as an alternative data source for concept detector training. Such video material is publicly available in large quantities from portals like YouTube. In our setup, tags provided by users during video upload serve as weak ground truth labels, such that thousands of concepts can be learned without manual annotation effort. On the downside, online video as a domain is complex, and the labels associated with it are coarse and unreliable, such that performance loss can be expected compared to high-quality standard training sets. To find out if it is possible to train concept detectors on web video, our TRECVID experiments compare state-of-the-art (visual only) concept detection systems when (1) training on the standard TRECVID development data and (2) training on clips downloaded from YouTube. Our key observation is that YouTube-based detectors work well for some concepts, but are overall significantly outperformed by the “specialized” systems trained on standard TRECVID’08 data (giving a infMAP of 2.2% and 2.1% compared to 5.3% and 6.1%). An in-depth analysis shows that a major reason for this seems to be redundancy in the TV08 dataset.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DFKI-IUPR participation in TRECVID’09 High-level Feature Extraction Task

Run No. Run ID Run Description infMAP (%) training on TV09 data (type: A) 1 IUPR-VW-TV SIFT visual words with SVMs 8.5 2 IUPR-ADAPT-TV SIFT visual words with PA1SD 5.1 combined training on YouTube and TV09 data (type: C) 3 IUPR-VW+TT-TV SIFT visual words with SVMs, fused with TubeTagger concept detection scores 8.3 4 IUPR-ADAPT-YT SIFT visual words with PA1SD, trained on YouTube, adapted to TV0...

متن کامل

TRECVID 2010 Known-item Search by NUS

This paper describes our system for auto search and interactive search in the known-item search (KIS) task in TRECVID 2010. KIS task aims to find an unique video answer for each text query. The shift from traditional video search has prompted a series of challenges in processing and searching techniques that developed over the past few years. For the automatic search task, our VisionGo system p...

متن کامل

LIG and LIRIS at TRECVID 2008: High Level Feature Extraction and Collaborative Annotation

This paper describes our participations of LIG and LIRIS to the TRECVID 2008 High Level Features detection task. We evaluated several fusion strategies and especially rank fusion. Results show that including as many low-level and intermediate features as possible is the best strategy, that SIFT features are very important, that the way in which the fusion from the various low-level and intermed...

متن کامل

MSRA atT TRECVID 2008: High-Level Feature Extraction and Automatic Search

This paper describes the MSRA experiments for TRECVID 2008. We performed the experiments in high-level feature extraction and automatic search tasks. For high-level feature extraction, we representatively investigated the benefit of global and local low-level features by a variety of learning-based methods, including supervised and semi-supervised learning algorithms. For automatic search, we f...

متن کامل

VideoSense at TRECVID 2011 : Semantic Indexing from Light Similarity Functions-based Domain Adaptation with Stacking

This paper describes our participation to the TRECVID 2011 challenge [1]. This year, we focused on a stacking fusion with Domain Adaptation algorithm. In machine learning, Domain Adaptation deals with learning tasks where the train and the test distributions are supposed related but different. We have implemented a classical approach for concept detection using individual features (low-level an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008